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Abstract 

In this study Box-Jenkins seasonal model is applied to records of mean flow to Dokan reservoir in the 
north of Iraq for the period from water year 1953/1954 to water year 2004/2005. Seven Box-Jenkins seasonal 
multiplicative models are fitted to this series. These are (l,l,l)x(l,l,l)i2 , (l,l,l)x(0,l,l)i2, 
(I,l,2)x(0,l,l) 12 ,(0,l,l)x(0,l,l)i2,(2,l,0)x(l,l,0)i2,(l,l,0)x(l,l,0)i2,(l,l,l)x(0,l,0)i2 models. The unconditional 
sum of squares method is used to estimate the parameters of the models and to compute the sum of square 
errors for each of the fitted model. It is found that the model which corresponds to the minimum sum of 
squared errors is the (l,l,l)x(l,l,l) 12 model with parameters 9=0.899,0=0.60,0=0.96, and O=0.03 . 
Portmanteau lack of fit test is used as diagnostic checking. Forecasts of monthly inflow for the period from 
October, 2005, to September,2007 are graphically compared with observed inflow for the same period and 
since agreement is very good adequacy of the selected model is confirmed. 
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Introduction 

A model which describes the probability structure of a sequence of observations is 
called a "stochastic process ". A time series of N successive observations z= ( zi Z2 z n ) 
is regarded as a sample realization, from an infinite population of such samples, which 
could have been generated by the process. An important class of stochastic processes is the 
stationary processes. They are assumed to be in a specific form of statistical equilibrium 
and in particular vary about a fixed mean. Particular stationary stochastic processes of 
value in modeling time series are the autoregressive ( abbreviated AR), moving average 
(abbreviated MA), and mixed autoregressive moving average processes (abbreviated 
ARMA). Another class of stochastic processes which is non stationary processes like 
autoregressive integrated moving average (abbreviated ARJJVIA ) models (Box and Jenkins, 
1976). Al-Suhaili (1986) used singlesite AR(1), autoregressive integrated moving 
average(ARIMA (1,0,1)) and (matalas model) for four Tigris river flow stations.. Abed 
(2007) applied Box- Jenkins seasonal multiplicative model of order (0, 1, 1) x (0, 1, 1) 12. to 
monthly records of some physical and chemical properties of river water in Babylon, Najaf, 
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and Diwaniya governorates . Al-Ta'ee (2009) applied ARIMA models to records of rainfall 
and evaporation at Babylon governorate. Ali (2009) fitted three Box- Jenkins seasonal 
multiplicative models to monthly inflow to bekhem reservoir. These are (l,l,0)x( 1,1,0) n , 
(2, l,0)x( 1,1,0) i2,(0, l,l)x(0,l,l)i2.It is found that the model which corresponding to the 
minimum sum of squared error is the ,(0,l,l)x(0,l,l)i2- 

In this study we fit other types of models which are the seasonal models called Box- 
Jenkins models to inflow of Dokan reservoir. Application of these seasonal models to data 
from Iraq is not found in the literature. 



Stochastic Processes 

Box and Jenkins (1976) have generalized the autoregressive integrated moving 
average ARIMA (p,d,q) model to deal with seasonality and define a general multiplicative 
seasonal model in the form : 

^ p (^)o P (^ ! )w t =e q (^)0 Q (^ ! )a t (1) 



Where: 

(J) ,0=autoregressive parameters 
0 ,0=moving average parameters 

P=backward shift operator such that: 

pZ t =Zn,and p s Z t =Z t . s , 

Z t =lnX t , 

X t =observed inflow (m 3 /sec), 
W t = V d Vf 2 Z t ,and 

^ =difference operators such that: 

V d z ( = z,-z,_ d , 



•(2) 



V 12 ^1 ~ ^ t-UD ' 

D,d=degrees of differencing, 

s= number of seasons (=12 for monthly data), and a t 's are drawings from a distribution of 
zero mean and constant variance ,i.e., white noise. 

This model ,i.e., Equation 1, is said to be of order (p,d,q)x(P,D,Q) s . For example, if 
p=P=l, d=D=l, and q=Q=l, equation 2 becomes: 
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(i - </>p)(i - op 12 )w, = (i - epxi - ep 12 )a t 



•(3) 



This model is called Box-Jenkins seasonal multiplicative model or Box-Jenkins 
seasonal moving average model of order (1, 1, 1) x (1, 1, l)i2.By the same way we 
obtained the equations of the other models and the results are listed in table below. 



Model 


Model Equation 


(1,1,1)X(1,1,1)12 


a - #f xi - op 12 )w t = a - ep){\ - ep 12 )a, 


(l,U)x(0,l,l)i2 


{\-cj>p)v t ={i-ep%-ep n )a t 


(l,l,2)x(0,U)i2 


(i-^)w ( = (i-^-^ 2 Xi-©^ 12 K 


(0,l,l)x(0,l,l)i2 


w t ={\-ep\\-ep l2 )a t 


(2,l,0)x(l,l,0)i2 


(l-^.P-^P^l-OP^^a, 


(1,1,0)X(1,1,0)12 


(l-<f>P)(l-Q>p n )V t =a t 


(l,l,l)x(0,l,0)i2 


{\-<j>p)w t ={\-ep)a t 



The main objective of the present study is to apply the Box and Jenkins model to 
monthly in flow to Dokan reservoir in the north of Iraq. 



Inflow to Dokan Reservoir 

The Dokan dam is located at about 60km from the north west of Al-Sulaimania 
town and at about 300 km from Kerkok governorate as shown in figure l).The main 
purposes of the Dokan project are to store and regulate the abundant water of the Lesser 
Zab River, a tributary of the Tigres River, by creating a large scale reservoir, to supply 
irrigation water required in the area downstream of the dam, and to control discharges 
downstream by impounding and regulating floods. 

In addition to the abovementioned purposes, the discharge and head obtained by the 
dam are to be utilized for power generation for effective use of hydraulic energy, 
thereby making this a multi-purpose, for irrigation, flood control, and power generation. 

In figure 2 we plot the means of inflows to Dokan reservoir for the entire period of 
record of 48 years from Oct., 1953 to Sep. ,2004. To show the periodic behavior of this series 
only a small part of it corresponding to the period from water year 1953/1954 to water year 
1957/1958 is given in figure 3 which, as expected, shows a marked seasonal pattern. 



Fitting Box -Jenkins Models to Dokan Inflow 

The seasonal ARIMA (p,d,q)x( P,D,Q) model -which is called "seasonal multiplicative 
autoregressive integrated moving average model "or Box - Jenkins seasonal model -is to 
fitted to a time series by using a three-stages procedure. These stages are model 
identification, estimation of model parameters and diagnostic checking of the estimated 
parameters. 

First of all the observations are plotted against time this will show up important features 
such as trend, seasonality, discontinuities and outliers. Figures 2 show that there is little 
trend ,high fluctuation, and seasonal variation for monthly inflow. 

The procedure of fitting is summarized by the following steps(Box and Jenkins, 1976): 
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1. Transform data using natural log transformation which was found the most appropriate. 

2. Removing trend component by using the first order differencing. 

3. Removing the seasonal variation by using the first order seasonal differencing. 

4. Model identification by plotting ACF and PACF of monthly observations. 

Identification of Representative Models 

By identification it is meant the use of the data, and of any information on how the 
series was generated, to suggest a subclass of models from the general Box- Jenkins family , 
i.e., Equation 1 for further examination. In other word, identification provides clues about 
the choice of the order of p ,d ,q ,P ,D ,and Q. However, in practice the degrees of 
differencing d and D are assumed 1 while autocorrelation and partial autocorrelation 
function are plotted to guess the order of p ,q ,P ,and Q. 

The estimation autocorrelation and partial autocorrelation function as shown in 
figures 4 and 5 respectively are characterized by correlations and autocorrelations which 
alternate in sign and which tend to damp out with increasing lags. 

Estimation of the Model Parameters 

The unconditional sum of square method is used to estimate the model parameters . 
Trial values for the parameters of the (l,l,l)x(l,l,l)i 2 model ,i.e. ,0 ,<b ,0 ,and O in 
Equation 3 are assumed and the sum of squares ,SS , is computed. Computation of SS is 
repeated with different values of 0 ,(f) ,0 ,and O until minimum sum of squares is obtained. 
To illustrate this procedure results are given in Table 1 for 0=0.899,0=0.60,0=0.96, and 
0=0.03 corresponding to the minimum sum of squares of 78.40. 

The number of observations is N=624 , n=N-d-SD =624-1-12= 611 is the number 
of dependent stochastic component (W t ), d is the degree of simple difference (d=l), 
D is the degree of seasonal difference (D=l), and S is the length of periodic cycle (S=12). 
The values of both 0 ,(J) ,0 ,and O vary from -1 to 1. We choose 0 =4>= 0= 0=-l as trial 
values and sum of square (SS) is computed. The procedure is repeated with different values 
of 0 ,(}) , 0 , and O until we obtain minimum sum of squares. Table 1 shows calculation 
corresponding to specific assumed values of 6 ,4> , 0 , and O. Then, the model may be 
written in either forward form as shown in the following equation(Box and Jenkins, 1976): 

[aJ=[wJ-^[W ( _J-O[W r _ 1 J+^[W ( _ 1 3] + 0[a ( _J + 0[a r _ 1 J-^[a r _ 13 ] (4) 

or backward form as follows: 

k]=[wj-*[wj-0[w„j+^^ (5) 

Hence, it is convenient to use a numbering system so that the first observation in the 
X t series (the second column in table 1) has a subscript -12, the last observation has a 
subscript 611, .The beginning of calculation is done by equation 5,i.e. when t=6 1 1 , to 

compute the [ej's (column 6) for t=6 1 1 , 610, ,1 by setting [e t ] =0 for t=612 , 613,and so 

on .Then the [W t ] series for t= 0,-1, -2,...., -12 is computed by equation (5) by setting [e t ] 
=0 for t=0, -1,...., -12. After that [aj's ( in fourth column ) are computed by equation 4 for 

t= -12 ,-11, ,-611 by setting [a t ]'s=0 for t=-13,-14,and so on. Hence ,the sum of 

squares(SS) is computed by the following equation: 

t=6U 

SS= Y a < ( 6 ) 

t=-\2 

Table 1 shows that the (SS=78.40 when (b=0.60, 0=0.03,6=0.899, and 0=0.96). Therefore, 
the model equation is: 

W t (l - 0.60/? )(l - 0.03 j3 12 )= (1 - 0.899 j3)(l - 0.96 j3 12 )a t (7) 
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The results of application of the unconditional sum of squares method to estimate the 
parameters for the all models are listed in table below: 



Model 


Parameters 


SS 


Oi 


e 2 


*i 


4>2 


0 


0 


(l,l,l)x(l,l,l) 12 


0.899 




0.600 




0.960 


0.030 


78.40 


(U,1)X(0,1,1)12 


0.870 




0.569 




0.955 




78.54 


(U,2)X(0,1,1)12 


-0.380 


0.390 


-0.758 




0.910 




79 


(0,1,1)X(0,1,1)12 


0.380 








0.926 




80 


(2,l,0)x(l,l,0)i2 






-0.370 


-0.220 




-0.500 


106 


(1,1,0)X(1,1,0)12 






-0.322 






-0.501 


111 


(l,l,l)x(0,l,0)i 2 


0.501 




0.122 








140 



Diagnostic Check 

After estimating the model parameters, the diagnostic checking is applied to see if the 
model is adequate or not. Therefore the following statistical tests are used: 

1. Port Manteau Lak of fit Test 

Portmanteau lack of fit test is used for this purpose. It is a test of the residual 
independency and uses the Q-statistic defined as : 

M 

Q=(N-d-DS)Y j r k 2 a t (8) 

k=l 

where r k (a t )is the autocorrelation coefficient of the residual (a t ) at lag k ,and M is the 
maximum lag considered (about N/4) (Chatfield,1982),SARIMA model is considered 
adequate if Q<% 2 a ,(M-n P ) where a is the level of significant , np is the number of model 
parameters, and the expression (M-np) represents the degree of freedom. 

The results of this test indicate that model is adequate because the calculated Q= 
(157.79) less the % 2 - table (188.01) with 152 degree of freedom (M-P-p-Q-q) at 97.5% 
confidence limits where M=156 for monthly data. 

2. Residual autocorrelation Function (RACF) Test 

The second test is the independency of the resulting (a t ) series, the correlogram of 
this series are computed for lag (M=N/5) are shown in figure (5) . The figure shown that the 
most of computed lags lie inside the tolerance interval (+2/VN, at 95% confidence limits). 
Hence, the suggested model can be considered as appropriate model because of its 
capability of removing the dependency from data. 

Forecasting 

Forecasted monthly data are computed for 3 years a head of original data for the 
period from 2005 to 2007 by applying the following equation (Box and Jenkins, 1976): 

T^it) =Z t ^ ( =Z MA +Z t _ h(A2 —Z t ^_ l?i +a t+f —(h t+ (_4 — Q3^_i2+^H+M3 + ^f-rf-i +< I ) W-rf-i2 — (9) 

where t is the origin time (t=624) and ^ is the lead time (=l,2,...36).After obtaining the 
forecasted series (Z t for t=625,626,627....660) ,then the final series (X t ) is determined by 
reversing (In) transformation .Figure 6 show the forecasted series for these data. The 
corresponding observed values are also shown in the figure and since agreement between 
observed and forecasted values is very good, it is confirmed that the model is adequate. 
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DOKAN 



PROJECT 



Figure 1: Sit Map of Dokan Reservoir. 
(Planning Report on Dokan Dam Project,2007) 
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Confidence Limits 



Figure 4: ACF and PACF of the series. 
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Table 1: Sum of squares(SS) computation for (l,l,l)x(l,l,l) 12 model with 
$=0.60, 0=0.03, 6=0.899, and 0=0.96. 



t(month) 


x t 


Z t =lnX t 


a t 


W t 


e t 


a 2 


-12 


40 


3.688879 


-0.09813 


-0.098135 


0 


0.00963 


-11 


42 


3.73767 


-0.43232 


-0.40298 


0 


0.186902 


-10 


106 


4.663439 


0.357215 


0.50408 


0 


0.127602 


-9 


209 


5.342334 


0.321455 


0.30277 


0 


0.103333 


-8 


441 


6.089045 


0.436095 


0.32877 


0 


0.190179 


-7 


670 


6.507278 


0.230599 


0.03581 


0 


0.053176 


-6 


1750 


7.467371 


1.027137 


0.84131 


0 


1.05501 


-5 


263 


5.572154 


-0.97608 


-1.3947 


0 


0.952741 


-4 


161 


5.081404 


0.231822 


0.27251 


0 


0.053742 


-3 


95 


4.553877 


0.099736 


0.054832 


0 


0.009947 


-2 


57 


4.043051 


-0.15116 


-0.20793 


0 


0.02285 


-1 


42 


3.73767 


-0.14414 


-0.13300 


0 


0.020778 


0 


40 


3.688879 


-0.11905 


0.0212 


0 


0.014173 
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-0.14911 
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0.022235 


605 


415 


6.028279 


-0.29523 


-0.58367 


-0.41977 


0.087159 


606 


325 


5.783825 


-0.33382 


-0.1037 


0.113101 


0.111433 


607 


330 


5.799093 


0.459118 


0.6893 


0.701179 


0.21079 


608 


120 


4.787492 


-0.16698 


-0.2544 


-0.15652 


0.027883 


L 609 


60 


4.094345 


-0.08299 


0.0764 


0.159818 


0.006887 


610 


40 


3.688879 


-0.15382 


0.18232 


0.214503 


0.023661 


611 


42 


3.73767 


0.040858 


0.10763 


0.107631 


0.001669 




0 


SS=78.40 
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Confidence Limits 



1 15 29 43 57 71 85 99 113 
8 22 36 50 64 78 92 106 120 

Lag Number 

Fig. (5): Autocorrelogram of residual series parameter. 
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Figure 6: Comparison of Forecasted and Observed Inflow 
(Oct.2005-Sep.2007). 



Conclusions 

The(October 1953-September 2004) time series of monthly inflow to Dokan 
reservoir is a periodic series and , therefore, the stochastic model which represents it is a 
seasonal one. Seven of such models are fitted to the series and it is found that sum of 
squared errors of the (l,l,l)x(l,l,l)i 2 model with autoregressive moving average 
parameters of 6=0.899,0=0.60,0=0.96, and 0=0.03 is less than the 
Other(l,l,l)x(0,l,l)i2,(l,l,2)x(0,l,l)i2,(0,l,l)x(0,l,l)i2,(2,l,0)x(l,l,0)i2,(l,l^ 
(l,l,l)x(0,l,0)i 2 models. The diagnostic checking show that model is adequate. Forecasts 
using the model for the period from October, 2005 to September 2007 agrees well with the 
observed values. 
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Notations 




ACF 


autocorrelation function 


D,d 


degrees of differencing 


PACF 


partial autocorrelation function 


SS 


sum of square errors 


s 


number of seasons 


X 


observed value 


Z 


lnX 


4> ,o 


autoregressive parameters 


6,0 


moving average parameters 


P 


backward shift operator 


V 


backward difference operator 
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